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At the meeting of the Practical Linguistics Group here in St, Louis two 
years ago* I was pleased to outline the cheory behind the Freshman English pro- 
gram at Cleveland State. That theory is based on the observation that a student's _ 
intuited linguistic knowledge can be made usaful to him if he can comprehend 
it direccly, if he can get it out of his left hemisphere and into the hand that 
holds his pen. The argument is simple: since a student already uses metaphor, 
logic * syntactic variety and embedding , and most of tlic other elements of lin- 
guistic structure, ha needs to learn how to control these elements and to adapt 
them to the conventions of writing* 



I said, too J that we were planning to investigate the syntactic and rhe-- 
torical structures that students write, and speculated chat there might be a 
quantitative measure of ''fluency^' for syntax and rhetoric, I-m hera today to 
report the ragults of the first phase of that investigation. 

From the beginning, we understood the need to identify and describe the 
optimally fluent English sentence. Such a datum would allow us to describe 
the ways in which student sentences differ from the "average" professional 
sentence and provide the^ facts^ perhaps, on which a goal-directed instruction 
could be based. By sampling a million sentences from representative prose 
writers, one could unearth, let's say, the ''average sentence" of respected 
writing* One might begin this task by calculating the average number of words 
per sentence Then one might figure out how many embeddings there are per 
sentence (relatives. Infinitives^ Indirect questions, etc,) Next one could count 
the noun phrases, prepositional phrases and clauses. Over on the rhetoric sidej 
though the problems are different and more difficult because the- basic rhetorical 
imit Is sometijaes larger, sometimes smaller than the orthographic unit, one could 
count santence connectors ^ Inversions, analogies, conscious metaphors, latlnate 
words and the rest, and get an average number of these per rhetorical unit. 
So then we would have some numbers i for syntaKj say* 13 words per orthographic 
unit, 1*3 clauses, 1.0 embeddings per clause; and for rhetoricp ,25 sentence 
connectorsi .10 inversions, .05 analogies, *05 conscious metaphors , 22.0 % 
latlnate forms . , * 



1^ 



Unfortunately, the eKercise wauld give statistical anthropological data 
whose explanatory power was feeble. Wiy? For one thing, no senteace of English 
could be concocted that had 1.2 clauses. Still, we eould determine to what ax** 
tents the average freshmai sentence deviates from the average professional 
sentence. For another, the standard average English sentence (SAES) wouldn't 
tell us much about why one author writes effectively but another does hot ^ 
Effective writing depends importantly on variety r more than two or three ; 
sentenees in a row with thirteen words In each aakes for bad writing, Besides, 
syntactic ow rhetorical complexity leads to poor writing as often as to good 
witting. Eheiorical complexity itself sometimes weakens the proi^. 
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At this pointy It would be tempting to expand the analysis. For instance, one 
might attempt to find out what sentence-length variety i£* lowing thiss we 
wouldn-t be constrained merely to observe that student sentences deviate from 
standard average length. We might write another program (because our imaging 
was sloppy) that deftiies average sentence length as anything from 9-17 (4 on 
either side of 13), .short as anything from 1-8, and long as any^thing above 17— 
and ask the computer to array sentence length in three ways and\tb\ calculate the 
Intervals, We could continue to elaborate the analysis^ down to gatting cdeffi- 
eients for the relationship between the incidence of relative clauses, say^ 
and logical connectors. We could dissect to the bone and bare the marrow^ but 
would such calculations tell us what a good sentence is, or paragraph? No* 
We would end up as the political analyst does, knowing everything there is to 
laiow about the voter except how he or she will vote. Nevertheless , there are 
some temptations to go about the investigation in this manner^ and some justi- 
fication. If we remljid ourselves that our first need is find out what the SA£S 
is, not the "good" English sentence, then the task needs no other justification. 
The proper questions arei How many words are there in the SAIS, how many clauses, 
how many embeddings? 

Let me just mention here what I consider to be the most troubling pedagogi- 
cal problem; Suppose in a thoroughgoing analysis of the syntaK and rhetoric of 
renowned writers and of fresteien we found such comparative facts as the ones 
displayed in the chart follo^ng. 



itructures 


Profassional 
Writing 


Freshman 
Writing 


Average number of sentences per paragraph 


10 


5 


Average number of words par paragraph 


200 


50 


Semantic field of nouns 


i • narrow 


wide 


Corralatlon of numbers of persuasive sentences 
and Illative relations 


, • high 


low 


Corralation between numbers of parsuasiva/lllative 
sentences and additive/illustrative santencea. 


. * , high 


low 



Wiat might wa do with such relationships, padagoglcally? How might we Int^rprat 
tham? All wa might do with tha facts Is fit thaa Into available thaoify ^of 

tha teaching of composition skills^ or chaorlEe anaw, (Hie question Aout^^ 
Intarpratation Is aubaaquent* I thli&, to further fact gatherlngv) Ilia pravious 
questleri la tha one that has Intarasted ma, namely, what la the syntactic and 
rhatorleal structure of a sa^la of frashman writing? Daspite the objactlons 
that I hava just raised. It was this quastion that Interested us./ I wish to 
ralata how wa want about answering Itv * ^S- / 



Wa analyzed a sa^le of fraslman writing to determine .what the quantltiai-^lsl^ 
were. And we have set the analysis of professional writing aside* The >ample;^?^^^ 
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SrM't eo^osi.iona ,,,,,,,, 

toward the end of our 0.0 quar"ar p^o^'a' No "udents) wrlttan 

to get a valid sample, though It did thoroughgoing effort was made 

vomen as men under roughly ?be same ^^"^^"^^ by as many ^ 

« thought to reduce' lln°Znlli:roET.\l 'lV'' ',1"/ ' "" o'clock, 

controlled at alli we do nof ? cne sample. No dependant variables were 

ho„ ..ny books th=v ^Ld? 5™ "tJL'^J.^S^f '"^ -u.s.s/ 

this. ^ -^^ watched, or anything eisE like 

syntaSe'unitf thf^Ju^f - i-l^ted the basic 

word Imperatives to fwi^ ^.^Wt^'^—I^^ °- ^ ^ 

laeh clause in each sentence "f'ae-d-''^ The turtle gave the twig a tug." 
and so forth. We listed the ellbora-.-^r "^.-"^ syntactic tvpe-V. N LV N, 

adverb, prepositional phraaj ^relatl^e el? "^uctures that we would records 

phrase, etc. (AopendlK A) Selaht"ier« ,f f ^^"^^ complement, infinitive 
ad hoc fashion,' but we filt^lnT-i^^^^^ f^^* in an 

Composition #1 
Clause #6 
W^da 16 
Type N LV ADJ 

, Elaborations. ^J-b .^predi^ ^^^^^^^^ ^^^^^^^ 

This Information was punched on cards and stored on tape. 

countS%S%oir^Sr.J«or -c^ SturelM" SlMSHaM^. We 
(". . .people were MgMgd In^fstreennri^ 'u"^ parallel structure 

was a friend, a l ^r! a mLage ^ n ^ ^ ''°T^^^^^'^'' asyndeton ("...he 
j«ts, and certain PraposlSSi 'phrLer F ai^^^^ ««bje«s.. verbs. ob= . 
appositlves ("...his uncle, a taxid.r^iL '-^ifSa I know nothing about. ..•') | 
idrives his Volvo (^yaO A S vof Sif Maine...") I parenthesis C. . he 

«Uipsls (. . .she bunting h ^ .""l Aatorlcal • 

neaia..., anaphora, conscious metaphor, analogy. 
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and antithesis .* Also, we have marked the logical relationships between 
successive units of tlought: addicive, adversative, illative. ' causative," illustra- 
tive, alternative, causal or nil. 

Then the computer. Using SPITBOL (Version 2.2) we wrote an arithmetic 
program that could report averages per composition and set for the items "counted 
py clause, by orthographical unit, and by paragraph.) We wanted to know fo^ 
instance, what the average number of embeddlngs per clause was. what the average 
number of parallel structures per paragraph was, etc. Computers are good with 
tnis sore of thing. We got our numbers (Appendix B) . 

Computers are better at a related accivity— statistical evaluation. We 
used two different evaluation procedures-. We submitted our numbers first to 
the SPSS (Statistical Package for the Social Sciences), and discovered that soma 
correlations were nearly perfect: when we applied T-tests to the means and 
variances that the SPSS provided we proved that the raw statistical differences 
were not significant. There's only one chance in a thousand, for example, that 
tne numoer oi embeddlngs per clause will not go uo as the number of words per 
clause goes up. Similarly, we would have got 0.91 embeddings for one set and 
1.15 for the other (the actual figures) 50% of the time, that is, just by sa-ipling 
error. In this case as in similar statistical cases, 0.91 equals 1.15. 

Pearson Correlacion Coefficients, cha second of our evaluation techniques 
verified that the factors we measured correlated positively with each other 
Clauses with few words in them tended to have few syntactic elaboration features 
Aow gross weights and low embedded weights. And vice versa. (There were some ' 
surprises which we have noced but have not yet evaluated. For example the 
incidence of syntactic elaborations that are not embeddings correlates 'with the 
Incidence of conjunctions and with the incidence of embeddings themselves. 
But the incidence of conjunctions does not correlate with the incidence of 
embeddings. Why the first and not the second?) 

In all, there were 612 orthographic units analyzed or 17 per composition 
(four compositions having been discarded for different reasons.) There were 
806 clauses, or 1.30 clauses per sentence. The mean number of words per clause 
was 13.434, and the median was 11.847. There were 3.810 elaboration structures 
per clause, the median being 3.117. The mean and median figures for embeddings 
per clause were 1.026 and 0.590, respectively. The average weight score was 
3.590, embedded weight accounting for 1.503 of this. The median weight score 
was 2.676, and the median embedded weight was. 0.72 7. 

clauses that the students wrote come very near these norms: 

...it is not possible to compare football with the brutality of gladiator 
fighting. 

(13 wordsi 4 elaborations.: predicate adjective complement, prepositional 
phrase, prepositional phrase, noun n&uni lembeddlngi gross weiahc 
3.25; embedded weight, 1.25. ^ ^' - * ' 

the above paragraphs are an Illustration of what I have experienced 
thus far in college, ^ : 

(15 wordii 4 elaborations! prepositional phrase, 'what' nominallzationVi 
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wjust now. In fact, we are trying to straighten out dif f iculties"-ln iwrking cheae^ 
^-■"Mt four. They are not easy to identify oblectively.KitinamblguTusiyil^jlf^^^ 
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adverbs prepositional phrase * 1 embedding j gross weight j 3,25| embedded 
weights 1.50* 

A sentence like "High school offers no challenge to brighter students'' illustrates 
Che unelaborated clause with trnxo score (also succinctness , a virtue which 
will go unrecognized until we have sophisticated the analysis considerably.,) 
A sentence like the one below illustrates the syntactic earthquake (above 7.6 
on the Jespersen scale.) 

1 feel I must support Gleason's statement about a person giving himself 
away when he talks by tying in some aspects of his talk with perhaps his 
age, more than likely his education by choice of words and proficiency 
by which the talk is delivered and perhaps social background by maybe 
relating to some of his ways of life or lives of others that he knows. 

The rhetorical analysis is presently in its final precomputer stage. The 
rhetorical features and logical relations have been marked , but the cards have 
not been punched. The only statistical values that can be reported today are 
those which we have estimated with paper and pencil^ as follows. Analogy * poly-- 
syndetonj syntactic inversion j simile and apposition seem rare. We get one 
analogy and one asyndetic series every six compositions. There are 0,8 syntactic 
inversions per composition, one appositive, and 0,3 similes, JParallallsms are 
more coMon^ about 1.6 of these per composition. But antithetical structures 
show up Just 0.7 times. This same frequency is recorded for anaphoric eKpressions. 
Students use conscious metaphors 3,8 Lilies ^ on the average, in their themes. 
Cliches (mercy me!) come in twoSj twice blessed. 

The variance in the incidence of logical connectors appears to be eKtremely 
wide. Some students use them abundantly (1,7 times per sentence!) ^ others not 
at all (zero times per composition!) Perhaps it is herej more than anywhere 
else, that statistical averages will turn out to be the most misleading. 

Students combine logical relations in apparently boundless permutations. 
Thmy explain points previously made in whole sentences immediately following, 
and in series of sentences which, among themselves, express various othtr ralations 
like additive, illustrative, adversative. They use illustrati^es as eKplanation, 
adversatives as alternatives ^ addlti%'es^as illustrations, rneir "logic," when 
it falls apart > does not break up bacayse of the order of the elements necessarily, 
but because of their conduct of them. As yet,, the program is capable of revaal^ 
ing little about what standard orderliness is. shall see what it is perhaps ^ 

as we look more and more closely at how students arrange the parts of their ar- 
gument and at the arrangements that prof eisionals use » 

la these remarks, I have wanted to persuade; you that the task of maasuring 
quantities ©£ cQmpositional structures appears to be both arduous and andlass, 
(I am ready to accept volunteers for the jab of analyzing a large sample ^f pro- 
fassional writing.) I Irave also wanted to suggest that the itructures we lave 
bean counting and the quantitative comparisons we have thought to make ma; not 
be the ones which best reveal the key attributes of syntaetlc/rhetorlcal style or 
which decisively eKpose the differences between professional witing and freshman 
witing, I am not convinced that a quantitative representation of prose can - 
divulge the secrets which it has kept from us* Mor am I convinced that the 
revealed secrets will help us to train writers* But I can tell you with enthu- ' 
slaam that I would like to haar your vlewa on these matters* 
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APPIMDIX A 



Weight 


Coc 2 


Elaboration 


0.25 


AVD 


Single Word Adverb 


0.50 


AJAJ 


Adjective + Adjective 


0,50 


NON 


Noun + Noun 


0,50 


■ PDSV 




0,75 


ADJXX 




0,75 


AVBXX 


Adverb'' a T_ PHt^^^^ TH/rjo T ft"^ms^ 


0.75 


EtCNJ 




1.00 


COi-ffAR 




1.00 


mm 


NOtm H" Nntiti ^ Mnim 


1.00 


PAPRT 


Pas t PaT" t"i r» T TS'f mi PH i^a 


1*00 


PRSPT 


PresBnt Par tleinlal PViT^ae^ 


1.25 


ADJYY 


«^j^wi«j-v&i ^ ii-j^O'^^ ijr ps ^teQuCBQ Aej.atXVe c Js^use f etc,/ 


1.25 


AVBYY 


Adverbial PHra^p Tvn^ TT ^r^aites 1- 
nuv^^w^ao. j.ype ii ^Qause^ uXieccxonj 3,ns cruiuent g ' 








1,25 


PAC 


Predicate Adi act iv^ ConiD ipmpn ^ 


1.25 


PSS 


Paisive 


1.25 


VPC 


Verb Phrase Complement 


1.50 


FOR 


For-to Nominaliaation 


1.50 


im 


-ing Nominaligation ■ !. 


1.50 


NPHC 


Noini Phraae Complement t 


1.50 


EEL 


Relative Clause |^ 


1.50 


THAT 


That NominaliEation 


1.50 


WH 


What Nominaligation 


1.50 


Emz 


Double En^edding 


3.00 


mm3 


Triple Imbedding 


6.00 




Quadruple Embedding - 


12.00 


nms 


Quin tuple Embedding 


1.50 




Extra l^ight For Nominalizations Used As Subjgets 



APPENDIX B 

The Syntactic Structure of the Clause 
in Freshman Writing 
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Number o£ 
worii 


Number. of 
EniDeaaings 


ESffiSraf ioni 


ii2|it 


untied 


Bean 
median 
range 
'variance 


13.434 
11.847 
66.000 
60.557 


1.026 
0.590 
10.000 
2.232 


3.810 
3.117 
26.000 
8.887 


3.S90 
2.676 
27.750 
W.790 


1.S03 ' 
0.727 
15.000 
5.047 




